Planning with Pixels in (Almost) Real Time

نویسندگان

  • Wilmer Bandres
  • Blai Bonet
  • Hector Geffner
چکیده

Recently, width-based planning methods have been shown to yield state-of-the-art results in the Atari 2600 video games. For this, the states were associated with the (RAM) memory states of the simulator. In this work, we consider the same planning problem but using the screen instead. By using the same visual inputs, the planning results can be compared with those of humans and learning methods. We show that the planning approach, out of the box and without training, results in scores that compare well with those obtained by humans and learning methods, and moreover, by developing an episodic, rollout version of the IW(k) algorithm, we show that such scores can be obtained in almost real time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Real-time detection of wildlife using NOAA/AVHRR data Study area :(Kayamaki Wildlife Refuge)

Forest fire in recent years has paid great attention to climate change and ecosystems. Remote sensing is a quick and inexpensive way to detect and monitor forest fires on a large scale. The purpose of this study was to identify forest and rangeland fire hazards using NOAA / AVHRR in Kayamaki Wildlife Refuge. For the purpose of this study, the history of the fire-burns occurred in MODIS products...

متن کامل

جداسازی طیفی با استفاده از الگوریتم HYCA بهبودیافته

Hyperspectral (HS) imaging is a significant tool in remote sensing applications. HS sensors measure the reflected light from the surface of objects in hundreds or thousands of spectral bands, called HS images. Increasing the number of these bands produces huge data, which have to be transmitted to a terrestrial station for further processing. In some applications, HS images have to be sent inst...

متن کامل

یک روش جدید افزایش دقت مکانی تصاویر سنجش از دور با استفاده از جدول جستجو

Different methods have been proposed to increase the image spatial resolution by mixed pixels decomposition. These methods can be divided into two groups. Some research have been attempted to obtain percentages of sub pixels and the other try to obtain their locations. These methods and their problems will be examined in this study. Common methods are reviewed with more emphasis. Finally, a new...

متن کامل

Building a Multi-Objective Model for Multi-Product Multi-Period Production Planning with Controllable Processing Times: A Real Case Problem

Model building is a fragile and complex process especially in the context of real cases. Each real case problem has its own characteristics with new concepts and conditions. A correct model should have some essential characteristics such as: being compatible with real conditions, being of sufficient accuracy, being logically traceable and etc. This paper discusses how to build an efficient mode...

متن کامل

Proper integration time of polarization signals of internetwork regions using Sunrise/IMaX data

Distribution of magnetic fields in the quiet-Sun internetwork areas has been affected by weak polarization (in particular Stokes Q and U) signals. To improve the signal-to-noise ratio (SNR) of the weak polarization signals, several approaches, including temporal integrations, have been proposed in the literature. In this study, we aim to investigate a proper temporal-integration time with which...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1801.03354  شماره 

صفحات  -

تاریخ انتشار 2018